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ABSTRACT 

In a situation where the population distribution of 
latent trait scores can be estimated, the ordinary maximum likelihood 
estimator of latent trait scores may be improved upon by taking the 
estr.raated population distribution into account* In this paper 
empirical Bayes estimators are compared with the liklihood estimator 
for three samples of 300 cases within the context of the Rasch model* 
The empirical Bayey estimates varied more than the likelihood 
estimates, due to the fact that not only item parameters, but also 
population distribution parameters had to be estimated from the 
sample data, the largest difference being 0*09 for a score equal to 
20* The results were based on a computer simulation for which the 
model aud distributional assumptions were known to be correct* For 
real data, there is a question of appropriateness of the 
distributional assumptions — apart from the question of fit of the 
Rasch Model, The normality assumption can be tested by means of the 
test of fit proposed by Andersen and Hadsen* If the normality 
assumption turns out to be inadequate, other distributions aay be 
fitted to the data using an approach similar to that proposed in this 
paper* fAnthor/EL) 
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In a situation where the population distribution of latent trait scores 
can be estimated, one may improve upon the ordinary maximum likelihood 
estimator of latent trait scores by taking the estimated population 
distribution into account. In the present paper empirical Bayes 
estimators are compared with the likelihood estimator within the context 
of the Rasch model, fhe data are simulated, with latent traic scores 
pen^iiated from a normal distribution. 

Introduction 

In a latent trait model persor, parameters or abilities can be estimated 
by the method of maximum likelihood, given the item parameters or 
estimates of the item parair.eters. The ML-method has some disadvantages, 
however. First, person parameters cannot be estimated for persons with 
a zero or perfect score, the estimates tending to ninus and plus 
infinity, respectively; in the three parameter logistic model also 
other score patterns have no unique maximum of the .^l^elihood equation 
(Samejima, 1973). Secondly, in case the persons can be regarded as 
randomly sampled from some population of persons, an empirical Bayes 
estimate of persons' abilities can be obtained which has a smaller 
expected mean squared error and is preferable for that reason; in 
the same way interval estimates for abilities can be obtained which 
compare favorably with intervals obtained through the information 
function. 



Paper presented at the European Meeting of the Psychometric 
Society, Groningen, June 19 thru 21, 1980. The author is indebted 
to Eric A. Bakker who wrote the computer programs. 



Bayesian and empirical Bayes procedures have been proposed betoro within 
the context of latent trait theory. Birnbaum (1969) and - in connection 
with tailored testing - Owen (1969; see also Jensema, 1974) have 
proposed a Bayesian analysis for the logistic and normal ogive latent 
trait models. Leonard (1972) has presented a Bayesian analysis for the 
binomial model using the 'log-odds' transformation; his model can be 
regarded as a special case of the Rasch model with all item parameters 
equal to zero. An empirical Bayes point estimate of ability has been 
derived by Meredith and Kearns (1973) for the multiplicative version 
of the Rasch model; the advantage of their approach is that no specific 
assumptions with respect to the population distribution have to be made 
and that the estimation of the population distribution can be bypassed. 
Sanathanan and Blumenthal (1978) have presented an empirical Bayes 
procedure for the additive Rasch model; they assume that the person 
parameter density belongs to a specific family of densities. In the 
present paper an empirical Bayes proccnire for the additive version 
of the Rasch model is proposed, assuming normally distribution abilities. 



The posterior distribution in the Rasch model 
The additive Rasch model reads 

(1) P(u. = l |e)=exp (e-b.)/ [ l+exp(0"b.^ 

where 6 is the parson parameter or ability, b. is the item parameter or 
difficulty of item i and P(u.=l|6) is the probability of a correct 
answer on Item i, given ability 6. Given the item parameters 
b=(bi ,b2, ' . . ,b^) or good item parameter estimates for the items of 
a n-item test, the likelihood of having t items correct equals 



1 



- 1 



(2) Lj^(t|e)=Yj. exp(te) n [l+expO-b.)] 

i=l . 

V 

where Yj. is the elementary symmetric function of order t of the n 
factors exp(-h. ) . 

Let us assume that the person parameters can be considered as randomly 
sampled from a population distribution g(6), i.e., gvO) is the prior 
distribution for the O's. Using Bayes' rule we obtain the posterior 
distribution of 9 



Equation (3) is similar to Birnbaum's ( 1969) Equation (A), In the present 
paper it is assumed that g(9) is a normal distribution (()(0|y,a^). 

The above model is not quite realistic in that a known population 
distribution is assumed. The more general approach is to assume that 
prior infonnation on y and is avail "^ible and to combine this 
information with data from the sample of persons in order to obtain 
posterior estimates of y and 0^. In an empirical Bayes approach only 
sample information is used in order to obtain the parameters o€ the 
population distribution. 

The normal population distribution 

In the logistic and normal ogive models the ability scale has interval 
characteristics, i.e. the models are invariant up to a linear transform- 
ation. For this reason one can choose the representation for which 
9 is N(0,1). Item parajieters have to be estimated on this particular 
scale. Bock and Lieberman (1970) present an estimation procedure for the 
two-parameter normal ogive model, Bock (1972) presents one for the 
logistic model without guessing parameter. An heuristic procedure for 
the three-parameter ogive model has been proposed by Urry (1976). 

The same could be done in connection with the Rasch model if one allows 
the common discrimination parameter to be unequal to one. We will, 
however, confine ourselves to the Rasch model proper in which only 
translations of the ability scale are allowed. 

For the Rasch model a procedure for the estimation of \i and of the 
population distribution has been presented by Andersen and Madsen (1977). 
They start with the marginal frequencies f(t), t=0,l,...,n, from (4) 
assuming accurately estimated item parameters. The lil^elihood of 

N scores t for t = 0,1,..., n in a sample ofN= Z N persons from the 

^ , . , . t=0 

population equals 



(3) f(0|t) 



f(t) 



. exp(tO) 



whoro 
(4) f(t) 



In f i+cxp(o-b.)l-" 

J 

- ^Yf. exp(te) ' n [ l+exp(e-b.)]| "*g(0)de . 



(5) L. - r [f(t|n,a')] 



where ((> denotofl the nomml frequency diHtribution. 

EgtimutQg of 0 and aro gbtainad by dtf f urontiating the logarithm of 
likelihood (5) with rcspocC to \.\ and and setting the results equal 
to zero: 



^^°«S.4> 3f(t) 
(6) — - E— ^ - 0 



ay f(t) 3y 



and 



^^°8S,<j) 9f(t) 

(7) — - i: 0 

90^ f(t) 90^ 

The derivatives equal 



9f( 



(8) — - J[(Q-M)/a^] exp(t0) |n [ l+exp(6-b^)]| *(ji(6)d0 

and 



-1 



9f(t) / r 1 r " r l' 

(9) = / i[(e-y)Va'*-Va2j exp(t0) |n I l+exp(0-b.)J 

30^ / I i=l 



"'(()(0)d0 



From (4), (8) and (9) it is clear that the elementary symmetric functions 
which do not contain 9, do not have to be computed. Equations (6) and 
(7) can be solved iteratively for p and by the Newton-Raphson method; 
in order to be able to compute the derivatives the integrals have to be 
approximated by sums. Andersen and Madsen also presents a likelihood 
ratio test for fit. 



Point and interval estimates for the person parameters 

The likelihood equation for the estimation of 0, given a score equal to 
t, equals 

9logl^(t|0) 

(10) h(0) = ^ = t - 2 exp(0-b.) [l+exp(0-b.) = 0 

O 99 III 
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A person parometor estlmato 5^ in obtained by aolving (10) intcrntlvoly 
for 0. An interval ostimnto of 0 - under the aflBumption of app /ox i junto 

A 

normality of the distribution of the likelihood estimate 0 and the 
assumption that the information function is fairly constant in t\\Q 
neighbourhood of the likelihood estimate - can be obtained from tha 
information function through 



(II) var (6) 



I (0)" 



n cxp(8-b.) 

? , l+exp(0-b.) 
1-1 ^ 1 



-I 



An empirical Bayes point estimate of 0 is the posterior mean 
(12) - E(0|t) - y0f(0|t)d0 



where f(0|t) is obtained from (3) replacing g(C) by the estimated 
population distribution* The posterior variance can be computed as 

(13) var (0|t) - E(e2|t)-E2(e|t) - f(0|t)de -^^^ . 



Under the assumption that the posterior distribution can be approximated 
by a normal distribution (12) and (13) can be used in order to obtain a 
posterior confidence interval for 9» 

Equations (12) and (13) also can be used for the estimation of y and 
by means of a procedure due to Sanathanan and Blumenthal (1978). From 
the equation one obtairs 

n 

(14) y = Z (N , ) E(e|t) 
t=0 'n 

and 



(15) + Ci^ = Z(N / . E(02|t;). 

Choosing initial values for y and a^, E(6|t) and E(6^|t) can be computed 
for t=0,...,n. Throsgh Equations (lA) and (15) one obtains new values 
for y and a^. One can repeat the procedure until convergence is obtained. 
As a by-product one obtains the posterior means and variances. The 
algorithm is simple in that no first and second order derivatives of 
f(t) have to be derived, but it also is computationally slow. 
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- u ~ 

Inatcad of thu poatorlot moan, tl)^' poatorlor mode 0^ rould bo uHod nn a 
point tsfltimato of 0, Tlio mode can bu obtninod by solvinR 

(16) k(0) - r - J:|oxp (0-b.) [ l"fexp(0 'b, )]''^ J - cr^(Q-M) - 0 

for 0 ; the tirst part of Equation (16) cqualn tho likelihood oqnation, the 
second part stems from tho normal population distribution. The modnl 
estimate is obtained by iterativcly solving (16) for 0. One may use the 
maximum likelihood estimate 0 as a starting value. Using a two-term 
Taylor expansion of (16) one then obtains 

(17) k(0) k(6) + (0-6) 

— (e-G) [1(6) +^]-o 

due to the fact that h(§) (cf. Equation (10)) equals zero. Equation (17) 
can be rewritten as 

(18) 6 - pg 6 + (1-pg) M 
where 

pg = a' [a' + (S) . 

Equation (18) clearly indicates that the empirical Bayes estimates 
are regressed to the mean; Equation (18) is a Kelley-f ormula for the 
estimation of 9. Estimate (i8), being the estimated 6 after the first 
iteration, may of course still differ to a certain extent from the 
final modal estimate 6. 

Also for the m'^dal estimate 9 an interval estimate can be obtained 
using 

(19) vSr (9|t) - [ 1(0) + a"^] . 
A simulation 

In order to demonstrate the feasibility and usefulness of empirical 
Bayes estimation within the context of the Rasch model, a simulation 
study was performed. It was assumed that the population distribution 
was N(0,1). Further, the hypothetical test was assumed to consist of 
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twenty Items I with all Item partimolorH tsquMl to at?ro, 
Throci hundred person paramotorH wcare rniulomly Renernted from t\w 
diHtrlbution N(0>l), Next, for ^nch hypothetlcnl purnon p « ncore pattern 
was generated with a probability of n earreet answer c«p(Op) [Hrxp<0p)j'"^ 
Cor each item. The 300 x 20 data matrix watt analystud UHing tlie uncondi- 
tional procedure for the oatimation of porHon and item pnrameterH from 
BICAL, written by Wright and Mead (cf. Wright and Stone, 19/9), The 
procedure contuinH a correction for binn in the item parameter e«timateH 
ari«ing in the situation where both seta of parameters nre estimated 
simultaneously; this bias was proved by Andersen (1973) for the cnse of 
two items, 

The item parameter estimates, based on the 293 cases with scores 
I x ^ 19, ranged from -,II3 to .182. Clearly regressed item parameter 
estimates have more optimal characteristics even if one does not invoke 
the concept of a distribution from which items are randomly sampled 
(Efron and Morris, 1975). In this paper we will, however, make use of 
the ordinary me*.* 'mum likelihood estimates of the item parameters. The 
estimated population mean and variance, using the Andersen and Madsen 
procedure, are {j « 0,047 and » 1.108, The person parameter estimates 
6, 6 and 6 are given in Table I, The posterior means were also computed 
by means of the Sanathanan and Blumenthal procedure; they are not 
presented here while t.iey were virtually identical with the estimates 
from the Andersen and Madsen procedure. 

For X equal to zero or twenty no maximum likelihood estimate is possible. 
In order to compiire the maximum likelihood and the empirical Hayes 
estimate, 'maximum likelihood' estimates for scores t « 0 and t = 20 
were computed by substitution of scores t = 0.5 and t = 19,5 respectivf^ly 
in the likelihood equation. 



Tabol !• PerHon paramoter entimatefl 



flcorc 


ft 
0 


0 


0 


0 


-3.67 


-2.29 


-2.21 


1 


-2.79 


-1.93 


-1 .87 


2 


-2.08 


-1.63 


-1.58 


3 


-1.65 


-1.37 


-1.33 


A 


-1.32 


-I.IA 


-1 . 10 


5 


-1 .OA 


-0.92 


-0.90 


6 


-0.80 


-0.72 


-0.70 


7 


-0.59 


-0.53 


-0.52 


8 


-0.39 


-0.35 


-0.3A 


9 


-0. 19 


-0.17 


-0. 17 


10 


0 


0.01 


0.0 


1 1 


0.19 


0.18 


0. 17 


12 


0.38 


0.36 


0.3A 


13 


0.59 


0.55 


0.52 


lA 


0.80 


0.7A 


0.70 


15 


l.OA 


0.9A 


0.90 


16 


1.32 


1 .16 


1.10 


17 


1 .65 


1 .39 


1 .33 


18 


2.09 


1.66 


1.58 


19 


2.79 


1 .96 


1.87 


20 


3.67 


2.32 


2.21 
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It la clear from Tfible I clwit th^ empirical My^n t>oint tJstimHU^s arf 
r©gr©fl»«(l to the wean, the poBterior mode ullKhtly more thrtn the 
poHterior mean eatimatea. Further, the re8re««i«n in stronRont for 
extroae ebaerved acorns* The eo»fiti^nce intervals based on th? pmpifiral 
UayoH approach, aro amaUor than the (nterval« hawed on the Intormailon 
function, a« can be seen from Table 2. Afiain, the empirical llayew 
approach hao the largoHt effect for extreme ticore». 



Table 2. EHtimatcd varianceti 



s cqito 




var vt' 1 


0 


i • U J 


U . Jo 


1 

1 


u. y 1 


t\ 1 o 


2 


0 ^ 1 


N 
U • 


3 


n T 7 


i\ 0 c 


A 




A 0 

U « Z.: 


5 




Urn l\ 


6 




u. ly 


7 




U« 1 7 


D 
O 


0. 2 1 


0. 18 


9 


o.;o 


0.18 


10 


0.20 


0. 18 


M 


0.20 


0. 18 


12 


0.21 


0.18 


13 


0.22 


0. 19 


lA 


0.23 


o.:o 


15 


0.26 


0.21 


16 


0.30 


0.23 


17 


0.37 


0.25 


18 


0.51 


0.28 


19 


0.91 


0.33 


20 


2.05 


0.39 



In order to compare the effectiveness of the three different point 
estimates, the mean squared error loss in the sample was computed. 
This was done for all 300 cases and for the subgroup with scores 
1 thru 19. The results are given in Table 3. Between brackets the 
results of two other simulations with 300 cases are presented. The 
empirical Bayes estimators 



TttbU 3, Mean ^t^a^red Ptr^t l^^^ 



0 .3U3 (.asa, .286^ 

^ •225 (•tpq, .209) 

6 .227 (.182, .213) 

^nn^n with Kcpr^tt I thru 
6 .276 (.221, .244) 

^ ,224 (.181, .196) 

0 .224 (.184, ,200) 

outweigh the mflximum Uki^lihood awtimntor, with i\ wUnht udv^ntaHt;^ fur 
thct puflterior mc»fln fi« 0xp<?cted with the cricitrion of miuux Hqurtrtul 
error Iobh. It ih clvixr that the mW^ntttp^k' of tht^ iimplric/tl \U\yvn 
oatiroatorn in dut^ In large p^irt to tha atronu rtrgri^HMton for 
extreme 0, 



Diiicuiiftion 

Pcr»on parameter cstimatcH which are rogresHed to the mean /ire on ihe 
average more accurate than ordinary maximum likelihood estimnten. 
In this paper thin in demanstrnted for three Hamplen of 300 ca8?« within 
the context of the Rasch model. The empirical Baye« entiroatea varied 
more than the likelihood estimates, due to the fact that not only item 
parameters, but also population distribution parameters had to be 
estimated from the sample data, the largest difference being CO for a 
score <(\jal to 20, In applications of course new samples should be 
combined with old samples in order to improve all parameter estimates. 
The results were based on a computer simulation for which the model 
and distributional assumptions were knowm to be correct. For real dara 
- apart irom the question of fit of the Rasch model - there is a 
question of appropriateness of the distributional assumptions. It 
remains to be examined how robust the results are when the normality 
assumption is violated. The normality assumption can be tested by means 
of the test of fit proposed by Andersen and Madsen, If the normality 
assumption turns out to bi: inadequate, other distributions may be 
fitted to the data using a similar approach as proposed in this paper. 

Interestingly, the fact that regressed estimates of person parameters 
are more efficient than the maximum likelihood estimator, is similar 
to regression effects in classical test theory. Regression effects 
will be larger with short tests and/or a small population variance 



(m i^nm\^ip the littler i^^np ^{vt*n liy Wiuid, l^^7M), u U ^t**f*r 

ftrmipN uf p^ifiiontt dlff<ar in iniri^l ^sUm^tisa ^biUM?»§, it not 
(ig^^iliU to obtain smmU i?unt lu^ilons if mw th^^s not f^iHount for 
r(^gr^i4iiion ^fft^etii. 
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